SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: POTTER , ANDREW A. 

REDMOND, MARK J. 
HUGHES , HUW P. A. 

(ii) TITLE OF INVENTION: ENHANCED IMMUNOGENICITY USING LEUKOTOXIN 
CHIMERAS 

(iii) NUMBER OF SEQUENCES: 11 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: ROBERTA L. ROBINS 

(B) STREET: 635 BRYANT STREET 

(C) CITY: FALO ALTO 

(D) STATE: CALIFORNIA 

(E) COUNTRY : UNITED STATES OF AMERICA 

(F) ZIP : 54301 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

„ (C) OPERATING SYSTEM: PC - DOS/MS * DCS 

^ (D) SOFTWARE: Fatentln Release £1.0, Version £1.25 

?T\ 

yp (vi) CURRENT APPLICATION DATA: 

six (A) APPLICATION NUMBER : US 07/560,932 

J. (B) FILING DATE: 14 -OCT- 19 52 

(C) CLASSIFICATION: 435 

yi 

01 (viii) ATTORNEY/ AGENT INFORMATION: 

m (A) NAME: ROBINS , ROBERTA L. 

(B) REGISTRATION NUMBER: 3 3,2 08 
3 (C) REFERENCE/DOCKET NUMBER :. 5000-0015. 20 

U (ix) TELECOMMUNICATION INFORMATION: 

' n i (A) TELEPHONE: (415) 617- 8559 

(B) TELEFAX: (415) 327-3231 

4* 

\Q (2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2754 base pairs 

(B) TYPE: nucleic acid 

(C) S T RAND E DN E S S : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 1 . .2778 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

ATGGCTACTG TTATAGATCT AAGCTTCCCA AAAACTGGGG CAAAAAAAAT TATCCTCTAT 6 0 

ATTCCCCAAA ATTACCAATA TGATACTGAA CAAGGTAATG GTTTACAGGA TTTAGTCAAA 120 

GCGGCCGAAG AGTTGGGGAT TGAGGTACAA AGAGAAGAAC GCAATAATAT TGCAACAGCT 180 
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TTGAAAGCTG TTGAAGAAAT TATCGGTACA TCACATAACG ATATCTTTAA AGGTAGTAAG 

TTCAATGATG CCTTTAACGG TGGTGATGGT GTCGATACTA TTGACGGTAA CGACGGCAAT 

GACCGCTTAT TTGGTGGTAA AGGCGATGAT ATTCTCGATG GTGGAAATGG TGATGATTTT 

ATCGATGGCG GTAAAGGCAA CGACCTATTA CACC-GTGGCA AGGGCGATGA TATTTTCGTT 

CACCGTAAAG GCGATGGTAA TGATATTATT ACCGATTCTG ACGGCAATGA TAAATTATCA 

TTCTCTGATT CGAACTTAAA AGATTTAACA TTTGAAAAAG TTAAACATAA TCTTGTCATC 

ACGAATAGCA AAAAAG A G AA AGTGACCATT CAAAACTGGT TCCGAGAGGC TGATTTTGCT 

AAAGAAGTGC CTAATTATAA AGCAACTAAA GATGAGAAAA TCGAAGAAAT CATCGGTCAA 

AATGGCGAGC GGATCACCTC AAAGCAAGTT GATGATCTTA TCGCAAAAGG TAACGGCAAA 

AT7ACCCAAG ATGAGCTATC AAAAGTTGTT GA7AACTATG AATTGCTCAA ACATAGCAAA 

AA7GTGACAA ACAGCTTAGA TAAGTTAA7C TCATCTGTAA GTGCATTTAC CTCGTCTAAT 

GATTCGAGAA ATGTATTAGT GGCTCCAACT TCAATGTTGG ATCAAAGTTT ATCTTCTCTT 

CAATTTGC7A GGGGATCCTA GCTAGCTAGC CATG 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 60 base pairs 
(5) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

GATCCAGCTC TTCTGCCGGC TGCAAAAACT TCTTCTGGAA AACCTTCACC AGCTGCTAGG 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 60 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TCPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 
GATCCCTAGC AGCTGGTGAA GGTTTTCCAG AAGAAGTTTT TC-CAGCCGGC AGAAGAGCTG 
(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DKA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID- NO: 4: 
GATCTCAGCA TTGGAGCTAC GGCCTGCGCC CTGGC7AAG 39 
(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3S base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDKESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DKA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GATCCTTAGC CAGGGCGCAG GCCGTAGCTC CAATGCTGA 39 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 83 base pairs 
p (3) TYPE: nucleic acid 

5; (C) STRANDEDKESS: single 

^ (D) TOPOLOGY: linear 

SI (ii) MOLECULE TYPE: DKA (genomic) 

ffl 

fp* (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Li J 

W GATCTTGCAA CATTGTGCCT GTGAGCATTG TGAGCCGCAA CATTGTGTAC ACCCGCGCGC 6 0 

fr| 

~ AACCTAACCA AGACATTGTG TAG 83 

f* (2) INFORMATION FOR SEQ ID NO : 7 : 

Rj (i) SEQUENCE CHARACTERISTICS : 
j= (A) LENGTH: 83 base pairs 

*E (5) TYPE: nucleic acid 

^ (C) STRANDEDNESS : single 

M (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DKA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
GATCCTACAC AATGTCTTGG TTAAGTTGCG CGCGGGTGTA CACAATGTTG CGGCTCACAA 6 0 

TCGTCACAGG CACAATGTTG CAA 83 
(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2838 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

(E) LOCATION: 1..2629 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

ATGGCTACTG TTATAGATCT AAGCTTCCCA AAAAC7GGGG CAAAAAAAAT TATCCTCTAT 

ATTCCCCAAA ATTACCAATA TGATACTGAA CAAGGTAATG GTTTACAGGA TTTAGTCAAA 

GCGGCCGAAG AGTTGGGGAT TGAGGTACAA AGAGAAGAAC GCAATAATAT TGCAACAGCT 

CAAACCAGTT TAGGCACGAT TCAAACCGCT ATTGGC77AA CTGAGCGTGG CATTGTGTTA 

TCCGCTCCAC AAATTGATAA ATTGCTACAG AAAA CT AAAG CAGGCCAAGC ATTAGGTTCT 

GCCGAAAGCA TTGTACAAAA TGCAAATAAA GCCAAAACTG TATTATCTGG CATTCAATCT 

ATTTTAGGCT CAGTATTGGC TGGAATGGAT TTAGATGAGG CCTTACAGAA TAACAGCAAC 

CAACATGCTC TTGCTAAAGC TGGCTTGGAG C7AACAAA77 CATTAATTGA AAATATTGCT 

AA7TCAG7AA AAACACTTGA CGAA777GG7 GAGCAAA77A G7CAA777GG 77CAAAAC7A 

CAAAA T A T CA AAGGCTTAGG GACTTTAGGA GACAAACTCA AAAATATCGG TGGACTTGAT 

AAAGCTGGCC TTGGTTTAGA 7G77A7C7CA GGGC7A77A7 CGGGCGCAAC AGCTGCACTT 

GTACTTGCAG A 7 AAAAA T G C T7CAACAGCT AAAAAA G 7 G G G7GCGGG777 TGAATTGGCA 

AACCAAGT7G 77GG7AA7AT TACCAAAGCC G777C77CTT ACA7777AGC CCAACG7GTT 

GCAGCAGG7T TATCTTCAAC 7GGGCC7G7G GC7GC777AA 77GC77C7AC TGT77C7CTT 

GCGAT7AGCC CAT7AGCATT TGCCGG7A77 GCCGA7AAAT 77AA7CA7GC AAAAA GT 7 7 A 

GAGAG77A7G CCGAACGC7T TAAAAAAT 7 A GGC7A7GACG GAGA7AA777 A77AGCAGAA 

TATCAGCGGG GAACAGGGAC 7A77GA7GCA 7CGG77ACTG CAA77AA7AC CGCATTGGCC 

GCTATTGCTG G7GG7G7G7C 7GC7GC7GCA GCCGGC7CGG 77A77GC77C ACCGAT7GCC 

77A77AG7A7 C7GGGA77AC CGG7G7AATT 7C7ACGA77C 7GCAA7A77C 7AAACAAGCA 

A7G777GAGC ACG7TGCAAA 7AAAA77CAT AACAAAAT7G TAGAATGGGA AAAAAA 7 AAT 

CACGGTAAGA AC7AC777GA AAA7GG77AC GA7GCCCG77 A7C77GCGAA 777ACAAGAT 

AATATGAAAT 7C77AC7GAA CTTAAACAAA GAG77ACAGG CAGAACG7G7 CATCGCTATT 

ACTCAGCAGC AA7GGGA7AA CAACA77GGT GA777AGC7G GTATTAGCCG 777AGG7GAA 

AAAG7CC7TA G7GG7AAAGC CTATGTGGAT GCGT7TGAAG AAGGCAAACA CAT7AAAGCC 

GATAAA77AG TACAGTTGGA 77CGGCAAAC GG7A77AT7G ATGTGAG7AA 77CGGG7AAA 

GCGAAAACTC AGCATATCTT ATTCAGAACG CCA77A7TGA CGCCGGGAAC AGAGCATCGT 

GAACGCGTAC AAACAGGTAA ATATGAATAT ATTACCAAGC TCAATATTAA CCG7G7AGAT 

AGC7GGAAAA 77ACAGA7GG TGCAGCAAGT 7C7ACCTTTG ATTTAACTAA CGT7GT7CAG 



CGTAT7GGTA TTGAATTAGA CAATGCTGGA AATGTAACTA AAACCAAAGA AACAAAAATT 174 0 

ATTGCCAAAC TTGGTGAAGG TGATGACAAC GTATTTGTTG GTTCTGGTAC GACGGAAATT 1800 

GATGGCGGTG AAGGTTACGA CCGAGTTCAC TATAGCCGTG GAAACTATGG TGCTTTAACT 1860 

ATTGATGCAA CCAAAGAGAC CGAGCAAGGT AGTTATACCG TAAATCGTTT CGTAGAAACC 192 0 

GGTAAAGCAC TACACGAAGT GACTTCAACC CATACCGCAT TAGTGGGCAA CCGT'GAAGAA 19 80 

AAAATAGAAT ATCGTCATAG CAATAACCAG CACCATGCCG GTT AT T A CAC CAAAGATACC 204 0 

TTGAAAGCTG TTGAAGAAAT TATCGGTACA TCACATAACG ATATCTTTAA AGGTAGTAAG 2100 

TTCAATGATG CCTTTAACGG TGGTGATGGT GTCGA7ACTA TTGACGGTAA CGACGGCAAT 2160 

GACCGCTTAT TTGGTGGTAA AGGCGATGAT ATTCTCGATG GTGGAAATGG TGATGATTTT 2220 

ATCGATGGCG GTAAAGGCAA CGACCTATTA CACGGTGGCA AGGGCGATGA TATTTTCGTT 2280 

CACCGTAAAG GCGATGGTAA TG AT ATT ATT ACCGATTCTG ACGGCAATGA TAAATTATCA 234 0 

TTCTCTG ATT CGAACTTAAA AGATTTAACA TTTGAAAAAG TTAAACATAA TCTTGTCATC 24 00 

ACGAATAGCA AAAAA G A G AA AGTGACCATT CAAAACTGGT TCCGAGAGGC TGATTTTGCT 2460 

AAAGAAGTGC C7AA77ATAA A G CAA C T AAA GATGAGAAAA TCGAAGAAAT CATCGGTCAA 252 0 

AATGGCGAGC GGATCACCTC AAAGCAAGTT GATGATCTTA TCGCAAAAGG TAACGGCAAA 25 8 0 

A.TTACCCAAG ATGAGCTATC AAAAGTTGTT GATAACTATG AATTGCTCAA ACATAGCAAA 264 0 

AATGTGACAA ACAGCTTAGA TAAG7TAATC TCATCTGTAA GTGCATTTAC CTCGTCTAAT 27 00 

GATTCGAGAA ATGTATTAGT GGCTCCAACT TCAATGTTGG ATCAAAGTTT ATCTTCTCTT 27 6 0 

CAATTTGCTA GGGGATCCAG CTCTTCTGCC GGCTGCAAAA ACTTCTTCTG GAAAACCTTC 2 82 0 

ACCAGCTGCT AGGGATCC 283 8 
(2) INFORMATION FOR SEQ ID NO:9: 

(i) SEQ'CZKiCE CHARACTERISTICS: 

(A) LENGTH: 2817 base pairs 

(B) TYPE: nucleic acid 

(C) STRAKDEDKESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1 . .2808 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 

ATGGCTACTG TTATAGATCT AAGCTTCCCA AAAA CTGGGG CAAAAAAAAT TATCCTCTAT 6 0 

ATTCCCCAAA ATTACCAATA TGATACTGAA CAAGGTAATG GTTTACAGGA TTTAGTCAAA 120 

GCGGCCGAAG AGTTGGGGAT T GAG GT A CAA AGAGAAGAAC GCAATAATAT TGCAACAGCT 180 
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CAAACCAG77 TAGGCACGAT TCAAACCGCT ATTGGCTTAA CTGAGCGTGG CATTGTGTTA 24 0 

TCCGCTCCAC AAATTGATAA ATTGCTACAG AAAACTAAAG CAGGCCAAGC A77AGG7TCT 3 00 

GCCGAAAGCA TTGTACAAAA TGCAAATAAA GCCAAAACTG TATTATCTGG CATTCAATCT 3 60 

A777TAGGCT CAGTATTGGC 7GGAA7GGA7 TTAGATGAGG CCTTACAGAA TAACAGCAAC 4 20 

CAACATGCTC TTGCTAAAGC TGGCTTGGAG CTAACAAATT CATTAATTGA AAA7AT7GCT 4 80 

AA7TCAGTAA AAACACTTGA CGAA777GGT GAGCAAATTA GTCAATTTGG TTCAAAACTA 54 0 

CAAAATATCA AAGGCTTAGG GACTTTAGGA GACAAACTCA AAAA7A7CGG TGGACTTGAT 6 00 

AAAGCTGGCC 77GG777AGA TGTTATCTCA GGGCTATTAT CGGGCGCAAC AGCTGCACTT 66 0 

GTACTTGCAG AT AAAAAT G C TTCAACAGCT AAAAAAGTGG GTGCGGGTTT TGAA77GGCA 72 0 

AACCAAGTTG TTGGTAATAT TACCAAAGCC G777C77C7T ACATTTTAGC CCAACGTGTT 78 0 

GCAGCAGGTT TATCTTCAAC TGGGCCTGTG GC7GC777AA TTGCTTCTAC TGTTTCTCTT 84 0 

GCGATTAGCC CA7TAGCATT TGCCGGTATT GCCGATAAAT 77AA7CATGC AAAAA G T T T A 5 00 

GAGAGT7ATG CCGAACGC7T 7AAAAAA7 TA GGC7A7GACG GAGATAA7TT ATT AG GAG AA 96 0 

7A7CAGCGGG GAACAGGGAC TATTGA7GCA 7CGG77ACTG CAAT7AA7AC CGCA77GGCC 1020 

GC7A77GCTG G7GG7G7G7C 7GC7GCTGCA GCCGGC7CGG 77AT7GC7TC ACCGATTGCC 10 80 

T7A77AGTAT CTGGGA7TAC CGG7G7AATT TCTACGA77C TGCAATA7TC TAAACAAGCA 114 0 

A7G77TGAGC ACGT7GCAAA TAAAAT7CAT AACAAAA77G TAGAATGGGA AAAAAA 7 AAT 12 00 

CACGGTAAGA AC7ACT77GA AAA7GGTTAC GA7GCCCGTT A7C7TGCGAA 77TACAAGAT 1260 

AA7A7GAAA7 7C77AC7GAA CTTAAACAAA GAG77ACAGG CAGAACG7GT CATCGC7A7T 1320 

AC7CAGCAGC AATGGGATAA CAACA77GGT GAT77AGCTG GTATTAGCCG 777AGG7GAA 13 8 0 

AAAG7CC77A G TG G 7 AAA GC CTATG7GGAT GCG7 77 GKAG AAGGCAAACA CA7TAAAGCC 14 4 0 

GAT AAA 7 7 AG 7ACAG77GGA 7TCGGCAAAC GGTATTATTG A7G7GAG7AA 77CGGGTAAA 15 0 0 

GCGAAAACTC AGCA7ATCTT A77CAGAACG CCAT7A7TGA CGCCGGGAAC AGAGCATCGT 156 0 

GAACGCG7AC AAACAGGTAA A7A7GAA7AT ATTACCAAGC TCAATATTAA CCG7G7AGAT 162 0 

AGC7GGAAAA 77ACAGA7GG TG CAGCAAGT 7C7ACC777G AT77AAC7AA CG7TG77CAG 168 0 

CG7A77GG7A 77GAA77AGA CAA7GC7GGA AA7G7AAC7A AAACCAAAGA AA CAAAAA7 T 174 0 

A77GCCAAAC 77 GG7GAAGG 7GA7GACAAC G7A777G77G G7TC7GG7AC GACGGAAATT 1800 

GA7GGCGG7G AAGG77ACGA CCGAG77CAC 7A7AGCCG7G GAAAC7A7GG 7GC7T7AACT 186 0 

A77GA7GCAA CCAAAGAGAC CGAGCAAGGT AG77A7ACCG 7AAA7CG7TT CG7AGAAACC 1S2 0 

GG7AAAGCAC TACACGAAGT GACT7CAACC CA7ACCGCA7 7AG7GGGCAA CCG7GAAGAA IS 8 0 

AAAA7 A G AA7 ATCG7CA7AG CAA7AACCAG CACCA7GCCG G77A77ACAC CAAAGA7ACC 2 04 0 
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TTGAAAGCTG TTGAAGAAAT TATCGGTACA TCACATAACG ATATCTTTAA AGGTAGTAAG ■ 2100 

TTCAATGATG CCTTTAACGG TGGTGATGGT GTCGATACTA TTGACGGTAA CGACGGCAAT 216 0 

GACCGCTTAT TTGGTGGTAA AGGCGATGAT ATTCTCGATG GTGGAAATGG TGATGATTTT 222 0 

ATCGATGGCG GTAAAGGCAA CGACCTATTA CACGGTGGCA AGGGCGATGA TATTTTCGTT 22 80 

CACCGTAAAG GCGATGGTAA TGATATTATT ACCGATTCTG ACGGCAATGA TAAATTATCA 234 0 

TTCTCTGATT CGAACTTAAA AGATTTAACA TTTGAAAAAG TTAAACATAA TCTTGTCATC 24 00 

ACGAATAGCA AAAAAGAGAA AGTGACCATT CAAAACTGGT TCCGAGAGGC TGATTTTGCT 24 6 0 

AAAGAAGTGC CTAATTATAA AGCAACTAAA GATGAGAAAA TCGAAGAAAT CATCGGTCAA 2520 

AATGGCGAGC GGATCACCTC AAAGCAAGTT GATGATCTTA TCGCAAAAGG TAACGGCAAA 25 8 0 

ATTACCCAAG ATGAGCTATC AAAAGTTGTT GATAACTATG AATTGCTCAA ACATAGCAAA 264 0 

AATGTGACAA ACAGCTTAGA TAAGTTAATC TCATCTGTAA GTGCATTTAC CTCGTCTAAT 27 00 

GATTCGAGAA ATGTATTAGT GGCTCCAACT TCAATGTTGG ATCAAAGTTT ATCTTCTCTT 27 6 0 

^ CAATTTGCTA GGGGATCTCA GCATTGGAGC TACGGCCTGC GCCCTGGCTA A GG AT CC 2817 
(2) INFORMATION FOR SEQ ID NO:10: 

W (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2861 base pairs 

(E) TYPE: nucleic acid 
y (C) STRANDEDNESS: double 

01 (D) TOPOLOGY: linear 



s 



EW 



(ii) MOLECULE TYPE: DNA (genomic) 



H (ix) FEATURE: 

U (A) NAME/KEY: CDS 

(E) LOCATION: 1..2853 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

ATGGCTACTG TTATAGATCT AAGCTTCCCA U--kAC?GGGG CAAAAAAAAT TATCCTCTAT 6 0 

ATTCCCCAAA ATTACCAATA TGATACTGAA CAAGGTAATG GTTTACAGGA TTTAGTCAAA 120 

GCGGCCGAAG AGTTGGGGAT TGAGGTACAA AGAGAAGAAC GCAATAATAT TGCAACAGCT 180 

CAAACCAGTT TAGGCACGAT TCAAACCGCT ATTGGCTTAA CTGAGCGTGG CATTGTGTTA 24 0 

TCCGCTCCAC AAATTGATAA ATTGCTACAG AAAACTAAAG CAGGCCAAGC ATTAGGTTCT 3 00 

GCCGAAAGCA TTGTACAAAA TGCAAATAAA GCCAAAACTG TATTATCTGG CATTCAATCT 36 0 

ATTTTAGGCT CAGTATTGGC TGGAATGGAT TTAGATGAGG CCTTACAGAA TAACAGCAAC 42 0 

CAACATGCTC TTGCTAAAGC TGGCTTGGAG CTAACAAATT CATTAATTGA AAATATTGCT 4 80 

AATTCAGTAA AAACACTTGA CGAATTTGGT GAGCAAATTA GTCAATTTGG TTCAAAACTA 54 0 

CAAAATATCA AAGGCTTAGG GACTTTAGGA GACAAACTCA AAAAT AT CGG TGGACTTGAT 600 



49 



AAAGCTGGCC 


TTGGTTTAGA 


TGTTATCTCA 


GGGCTATTAT 


CGGGCGCAAC 


AGCTGCACTT 


660 


GTACTTGCAG 


ATAAAAATGC 


TTCAACAGCT 


AAAAAAGTGG 


GTGCGGGTTT 


TGAATTGGCA 


720 


AACCAAGTTG 


TTGGTAATAT 


TACCAAAGCC 


GTTTCTTCTT 


ACATTTTAGC 


CCAACGTGTT 


780 


GCAGCAGGTT 


TATCTTCAAC 


TGGGCCTGTG 


GCTGCTTTAA 


TTGCTTCTAC 


TGTTTCTCTT 


840 


GCGATTAGCC 


CATTAGCATT 


TGCCGGTATT 


GCCGATAAAT 


TTAAT CAT GC 


AAAAAGTTTA 


SOO 


GAGAGTTATG 


CCGAACGCTT 


TAAAAAATTA 


GGCTATGACG 


GAGATAATTT 


ATTAGCAGAA 


S60 


TATCAGCGGG 


GAACAGGGAC 


TATTGATGCA 


TCGGTTACTG 


CAATTAATAC 


CGCATTGC-CC 


1020 


GCTATTGCTG 


GTGGTGTGTC 


TGCTGCTGCA 


GCCGGCTCGG 


TTATTGCTTC 


ACCGATTGCC 


1080 


TTATTAGTAT 


CTGGGATTAC 


CGGTGTAATT 


TCTACGATTC 


TGCAATATTC 


TAAACAAGCA 


1140 


ATGTTTGAGC 


ACGTTGCAAA 


TAAAATTCAT 


AACAAAATTG 


TAGAATGGGA 


AAAAAATAAT 


1200 


CACGGTAAGA 


ACTACTTTGA 


AAATGGTTAC 


GATGCCCGTT 


ATCTTGCGAA 


TTTACAAGAT 


1260 


AATATGAAAT 


TCTTACTGAA 


CTTA.iACA.AA 


GAGTTACAGG 


CAGAACGTGT 


CATCGCTATT 


1320 


ACTCAGCAGC 


AATGGGATAA 


CAACATTGGT 


GATTTAGCTG 


GTATTAGCCG 


TTTAGGTGAA 


1360 


AAAGTCCTTA 


GTGGTAAAGC 


C TAT GTGG AT 


GCGTTTGAAG 


AAGGCAAACA 


CATTAAAGCC 


1440 


GAT AAA T TAG 


TACAGTTGGA 


TTCGGCAAAC 


GGTATTATTG 


ATGTGAGTAA 


TTCGGGTAAA 


1500 


GCGAAAACTC 


AGCATATCTT 


ATTCAGAACG 


CCATTATTGA 


CGCCGGGAAC 


AGAGCATCGT 


1560 


GAACGCGTAC 


AAACAGGTAA 


ATATGAATAT 


ATTACCAAGC 


TCAATATTAA 


CCGTGTAGAT 


1620 


AGCTGGAAAA 


TTACAGATGG 


TGCAGCAAGT 


TCTACCTTTG 


ATTTAACTAA 


CGTTGTTCAG 


1660 


CGTATTGGTA 


TTGAATTAGA 


CAATGCTGGA 


AATGTAACTA 


AAACCAAAGA 


AACAAAAATT 


1740 


ATTGCCAAAC 


TTGGTGAAGG 


TGATGACAAC 


GTATTTGTTG 


GTTCTGGTAC 


GACGGAAATT 


1800 


GATGGCGGTG 


AAGGTTACGA 


CCGAGTTCAC 


TATAGCCGTG 


GAAACTATGG 


TGCTTTAACT 


1860 


ATTGATGCAA 


CCAAAGAGAC 


CGAGCAAGGT 


AGTTATACCG 


TAAATCGTTT 


CGTAGAAACC 


1520 


GGTAAAGCAC 


TACACGAAGT 


GACTTCAACC 


CATACCGCAT 


TAGTGGGCAA 


CCGTGAAGAA 


1S80 


AAAATAGAAT 


ATCGTCATAG 


CAATAACCAG 


CACCATGCCG 


GTTATTACAC 


CAAAGATACC 


2040 


TTGAAAGCTG 


TTGAAGAAAT 


TATCGGTACA 


TCACATAACG 


ATATCTTTAA 


AGGTAGTAAG 


2100 


TTCAATGATG 


CCTTTAACGG 


TGGTGATGGT 


GTCGATACTA 


TTGACGGTAA 


CGACGGCAAT 


2160 


GACCGCTTAT 


TTGGTGGTAA 


AGGCGATGAT 


ATTCTCGATG 


GTGGAAATGG 


TGATGATTTT 


2220 


ATCGATGGCG 


GTAAAGGCAA 


CGACCTATTA 


CACGGTGGCA 


AGGGCGATGA 


TATTTTCGTT 


2280 


CACCGTAAAG 


GCGATGGTAA 


TGATATTATT 


ACCGATTCTG 


ACGGCAATGA 


TAAATTATCA 


2340 


TTCTCTGATT 


CGAACTTAAA , 


AGATTTAACA 


TTTGAAAAAG 


TTAAACATAA 


TCTTGTCATC 


2400 


ACGAATAGCA - 


AAAAAGAGAA , 


AGTGACCATT 


CAAAACTGGT 


TCCGAGAGGC 


TGATTTTGCT 


2460 
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AAAGAAGTGC 


CTAATTATAA 


AGCAACTAAA 


GATGAGAAAA 


TCGAAGAAAT 


CATCGGTCAA 


2520 


AATGGCGAGC 


GGATCACCTC 


AAAGCAAGTT 


GATGATCTTA 


TCGCAAAAGG 


TAACGGCAAA 


2560 


ATTACCCAAG 


ATGAGCTATC 


AAAAGTTGTT 


GATAACTATG 


AATTGCTCAA 


ACATAGCAAA 


2640 


AATGTGACAA 


ACAGCTTAGA 


TAAGTT AAT C 


TCATCTGTAA 


GTGCATTTAC 


CTCGTCTAAT 


2700 


GATTCGAGAA 


ATGTATTAGT 


GGCTCCAACT 


TCAATGTTGG 


ATCAAAGTTT 


ATCTTCTCTT 


2760 


CAATTTGCTA 


GGGGATCTTG 


CAACATTGTG 


CCTGTGAGCA 


TTGTGAGCCG 


CAACATTGTG 


2620 


TACACCCGCG 


CGCAACCTAA 


CCAAGACATT 


GTGTAGGATC 


C 




2861 


(2) INFORMATION FOR SEQ ID NO: 11: 











<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: Sinole 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

q (ix) FEATURE: 

^" (A) ICA.ME/KEY-: Modified- site 

y* (B) LOCATION: 3 

*B (D) OTHER INFORMATION: /note= "The amino acid at this 

S! location can be either Lys, Asp, Val or Asn." 

(ix) FEATURE: 

(A) NAME /KEY : Modi f i ed - S i te 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /note* "The amino =cid at this 
location can be either Lys, Asp, val or Asn." 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 1 : 



hi Gly Gly Xaa Gly Xaa Asp 

! * 1 5 



1 
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